7 research outputs found

    Algorithm selection using edge ML and case-based reasoning

    Get PDF
    In practical data mining, a wide range of classification algorithms is employed for prediction tasks. However, selecting the best algorithm poses a challenging task for machine learning practitioners and experts, primarily due to the inherent variability in the characteristics of classification problems, referred to as datasets, and the unpredictable performance of these algorithms. Dataset characteristics are quantified in terms of meta-features, while classifier performance is evaluated using various performance metrics. The assessment of classifiers through empirical methods across multiple classification datasets, while considering multiple performance metrics, presents a computationally expensive and time-consuming obstacle in the pursuit of selecting the optimal algorithm. Furthermore, the scarcity of sufficient training data, denoted by dimensions representing the number of datasets and the feature space described by meta-feature perspectives, adds further complexity to the process of algorithm selection using classical machine learning methods. This research paper presents an integrated framework called eML-CBR that combines edge edge-ML and case-based reasoning methodologies to accurately address the algorithm selection problem. It adapts a multi-level, multi-view case-based reasoning methodology, considering data from diverse feature dimensions and the algorithms from multiple performance aspects, that distributes computations to both cloud edges and centralized nodes. On the edge, the first-level reasoning employs machine learning methods to recommend a family of classification algorithms, while at the second level, it recommends a list of the top-k algorithms within that family. This list is further refined by an algorithm conflict resolver module. The eML-CBR framework offers a suite of contributions, including integrated algorithm selection, multi-view meta-feature extraction, innovative performance criteria, improved algorithm recommendation, data scarcity mitigation through incremental learning, and an open-source CBR module, reshaping research paradigms. The CBR module, trained on 100 datasets and tested with 52 datasets using 9 decision tree algorithms, achieved an accuracy of 94% for correct classifier recommendations within the top k=3 algorithms, making it highly suitable for practical classification applications

    Multicriteria Decision Making Using TOPSIS Method: An Accurate Selection of Deserving Candidates

    Get PDF
    In intelligent decision support systems, multi-criteria decision-making techniques have widely been used for various kinds of executive decisions. Under the United Nation poverty alleviation programs, Non-Governmental (NGO), Non-Profitable Organizations (NPO) and privileged people use survey methods to collect deserving people’s data for charitable donations, for example financial aid etc. They use manual or semi-automatic data collection methods and then consider very few criteria to finalize the list of deserving donees for the aid in hand. This normally results in a list including donees who either don’t deserve or if so, they are not the right candidates. In addition to this, final recommendations of the NGOs and NPOs don’t address poverty gap of the donees and place them all at the same level which results in unfair distribution of the charity. This makes the problem as a complex decision-making process due to the consideration of multiple criteria at a time. To address these issues, this paper formulates the problem of deserving donees selection as a multi-criteria decision-making problem. A novel multi-criteria decision-making methodology, integrating Analytical Hierarchy Process (AHP) with Technique for Order Preference by Similarity to Ideal Solution(TOPSIS), is proposed to rank the candidates list according to their intensity of poverty gap. To realize the proposed methodology, a set of 15 unique criteria are identified and data is collected from 212 subjects, which are then ranked accordingly. The results show that the methodology is assistive to managements of NPOs and privileged individuals to accurately rank donees and consequently disburse aids in most deserving families

    Large scale image dataset construction using distributed crawling with hadoop YARN

    Get PDF
    © 2018 IEEE. with the rapid advancement in the internet, we are now living in the era of big data. The image data over the web has the potential to assist in the development of sophisticated and robust models and algorithms to interact with images and multimedia data. Images Data sets are widely used in image processing tasks and analyses. They are in use in various fields including Artificial Intelligence, Data extraction and collection, Computer Vision, Research studies and education. In this research work, we are going to propose a system that crawls the web in a systematic manner using Hadoop MapReduce technique to collect images from millions of web pages on the web. With Celebrity images just a use case, we would be able to search for and retrieve any image tagged with some specific terms. The system uses some simple techniques to reduce noisy images like thumbnails and icons. The proposed system is based on Apache Hadoop and Apache Nutch, an open source web crawler. A customized crawl is run through Apache Nutch in a Hadoop Cluster that searches images for one or more categories on the web and retrieves their links. Next, HIPI, Hadoop Image Processing Interface is used to download the images and create datasets for an individual category or a dataset of multiple categories

    Respiratory support in patients with severe COVID-19 in the International Severe Acute Respiratory and Emerging Infection (ISARIC) COVID-19 study: a prospective, multinational, observational study

    No full text
    Background: Up to 30% of hospitalised patients with COVID-19 require advanced respiratory support, including high-flow nasal cannulas (HFNC), non-invasive mechanical ventilation (NIV), or invasive mechanical ventilation (IMV). We aimed to describe the clinical characteristics, outcomes and risk factors for failing non-invasive respiratory support in patients treated with severe COVID-19 during the first two years of the pandemic in high-income countries (HICs) and low middle-income countries (LMICs). Methods: This is a multinational, multicentre, prospective cohort study embedded in the ISARIC-WHO COVID-19 Clinical Characterisation Protocol. Patients with laboratory-confirmed SARS-CoV-2 infection who required hospital admission were recruited prospectively. Patients treated with HFNC, NIV, or IMV within the first 24 h of hospital admission were included in this study. Descriptive statistics, random forest, and logistic regression analyses were used to describe clinical characteristics and compare clinical outcomes among patients treated with the different types of advanced respiratory support. Results: A total of 66,565 patients were included in this study. Overall, 82.6% of patients were treated in HIC, and 40.6% were admitted to the hospital during the first pandemic wave. During the first 24 h after hospital admission, patients in HICs were more frequently treated with HFNC (48.0%), followed by NIV (38.6%) and IMV (13.4%). In contrast, patients admitted in lower- and middle-income countries (LMICs) were less frequently treated with HFNC (16.1%) and the majority received IMV (59.1%). The failure rate of non-invasive respiratory support (i.e. HFNC or NIV) was 15.5%, of which 71.2% were from HIC and 28.8% from LMIC. The variables most strongly associated with non-invasive ventilation failure, defined as progression to IMV, were high leukocyte counts at hospital admission (OR [95%CI]; 5.86 [4.83-7.10]), treatment in an LMIC (OR [95%CI]; 2.04 [1.97-2.11]), and tachypnoea at hospital admission (OR [95%CI]; 1.16 [1.14-1.18]). Patients who failed HFNC/NIV had a higher 28-day fatality ratio (OR [95%CI]; 1.27 [1.25-1.30]). Conclusions: In the present international cohort, the most frequently used advanced respiratory support was the HFNC. However, IMV was used more often in LMIC. Higher leucocyte count, tachypnoea, and treatment in LMIC were risk factors for HFNC/NIV failure. HFNC/NIV failure was related to worse clinical outcomes, such as 28-day mortality. Trial registration This is a prospective observational study; therefore, no health care interventions were applied to participants, and trial registration is not applicable

    Respiratory support in patients with severe COVID-19 in the International Severe Acute Respiratory and Emerging Infection (ISARIC) COVID-19 study: a prospective, multinational, observational study

    No full text
    Background: Up to 30% of hospitalised patients with COVID-19 require advanced respiratory support, including high-flow nasal cannulas (HFNC), non-invasive mechanical ventilation (NIV), or invasive mechanical ventilation (IMV). We aimed to describe the clinical characteristics, outcomes and risk factors for failing non-invasive respiratory support in patients treated with severe COVID-19 during the first two years of the pandemic in high-income countries (HICs) and low middle-income countries (LMICs). Methods: This is a multinational, multicentre, prospective cohort study embedded in the ISARIC-WHO COVID-19 Clinical Characterisation Protocol. Patients with laboratory-confirmed SARS-CoV-2 infection who required hospital admission were recruited prospectively. Patients treated with HFNC, NIV, or IMV within the first 24 h of hospital admission were included in this study. Descriptive statistics, random forest, and logistic regression analyses were used to describe clinical characteristics and compare clinical outcomes among patients treated with the different types of advanced respiratory support. Results: A total of 66,565 patients were included in this study. Overall, 82.6% of patients were treated in HIC, and 40.6% were admitted to the hospital during the first pandemic wave. During the first 24 h after hospital admission, patients in HICs were more frequently treated with HFNC (48.0%), followed by NIV (38.6%) and IMV (13.4%). In contrast, patients admitted in lower- and middle-income countries (LMICs) were less frequently treated with HFNC (16.1%) and the majority received IMV (59.1%). The failure rate of non-invasive respiratory support (i.e. HFNC or NIV) was 15.5%, of which 71.2% were from HIC and 28.8% from LMIC. The variables most strongly associated with non-invasive ventilation failure, defined as progression to IMV, were high leukocyte counts at hospital admission (OR [95%CI]; 5.86 [4.83–7.10]), treatment in an LMIC (OR [95%CI]; 2.04 [1.97–2.11]), and tachypnoea at hospital admission (OR [95%CI]; 1.16 [1.14–1.18]). Patients who failed HFNC/NIV had a higher 28-day fatality ratio (OR [95%CI]; 1.27 [1.25–1.30]). Conclusions: In the present international cohort, the most frequently used advanced respiratory support was the HFNC. However, IMV was used more often in LMIC. Higher leucocyte count, tachypnoea, and treatment in LMIC were risk factors for HFNC/NIV failure. HFNC/NIV failure was related to worse clinical outcomes, such as 28-day mortality. Trial registration This is a prospective observational study; therefore, no health care interventions were applied to participants, and trial registration is not applicable

    Characteristics and outcomes of an international cohort of 600 000 hospitalized patients with COVID-19

    Get PDF
    Background: We describe demographic features, treatments and clinical outcomes in the International Severe Acute Respiratory and emerging Infection Consortium (ISARIC) COVID-19 cohort, one of the world's largest international, standardized data sets concerning hospitalized patients. Methods: The data set analysed includes COVID-19 patients hospitalized between January 2020 and January 2022 in 52 countries. We investigated how symptoms on admission, co-morbidities, risk factors and treatments varied by age, sex and other characteristics. We used Cox regression models to investigate associations between demographics, symptoms, co-morbidities and other factors with risk of death, admission to an intensive care unit (ICU) and invasive mechanical ventilation (IMV). Results: Data were available for 689 572 patients with laboratory-confirmed (91.1%) or clinically diagnosed (8.9%) SARS-CoV-2 infection from 52 countries. Age [adjusted hazard ratio per 10 years 1.49 (95% CI 1.48, 1.49)] and male sex [1.23 (1.21, 1.24)] were associated with a higher risk of death. Rates of admission to an ICU and use of IMV increased with age up to age 60 years then dropped. Symptoms, co-morbidities and treatments varied by age and had varied associations with clinical outcomes. The case-fatality ratio varied by country partly due to differences in the clinical characteristics of recruited patients and was on average 21.5%. Conclusions: Age was the strongest determinant of risk of death, with a ∼30-fold difference between the oldest and youngest groups; each of the co-morbidities included was associated with up to an almost 2-fold increase in risk. Smoking and obesity were also associated with a higher risk of death. The size of our international database and the standardized data collection method make this study a comprehensive international description of COVID-19 clinical features. Our findings may inform strategies that involve prioritization of patients hospitalized with COVID-19 who have a higher risk of death

    ISARIC-COVID-19 dataset: A Prospective, Standardized, Global Dataset of Patients Hospitalized with COVID-19

    No full text
    The International Severe Acute Respiratory and Emerging Infection Consortium (ISARIC) COVID-19 dataset is one of the largest international databases of prospectively collected clinical data on people hospitalized with COVID-19. This dataset was compiled during the COVID-19 pandemic by a network of hospitals that collect data using the ISARIC-World Health Organization Clinical Characterization Protocol and data tools. The database includes data from more than 705,000 patients, collected in more than 60 countries and 1,500 centres worldwide. Patient data are available from acute hospital admissions with COVID-19 and outpatient follow-ups. The data include signs and symptoms, pre-existing comorbidities, vital signs, chronic and acute treatments, complications, dates of hospitalization and discharge, mortality, viral strains, vaccination status, and other data. Here, we present the dataset characteristics, explain its architecture and how to gain access, and provide tools to facilitate its use
    corecore